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Abstract. We consider transactional memory contention management 
in the context of balanced workloads, where if a transaction is writing, 
the number of write operations it performs is a constant fraction of its 
total reads and writes. We explore the theoretical performance bound- 
aries of contention management in balanced workloads from the worst- 
case perspective by presenting and analyzing two new polynomial time 
contention management algorithms. The first algorithm Clairvoyant is 
0( v / s)-competitive, where s is the number of shared resources. This al- 
gorithm depends on explicitly knowing the conflict graph. The second 
algorithm Non-Clairvoyant is 0(y/s Tog n)-competitive, with high proba- 
bility, which is only a O(logn) factor worse, but does not require knowl- 
edge of the conflict graph, where n is the number of transactions. Both 
of these algorithms are greedy. We also prove that the performance of 
Clairvoyant is tight, since there is no polynomial time contention man- 
agement algorithm that is better than 0((y/s) _E )-competitive for any 
constant e > 0, unless NPCZPP. To our knowledge, these results are sig- 
nificant improvements over the best previously known O(s) competitive 
ratio bound. 



1 Introduction 

The ability of multi-core architectures to increase application performance de- 
pends on maximizing the utilization of the computing resources provided by 
them and using multiple threads within applications. These architectures present 
both an opportunity and challenge for multi-threaded software. The opportunity 
is that threads will be available to an unprecedented degree, and the challenge 
is that more programmers will be exposed to concurrency related synchroniza- 
tion problems that until now were of concern only to a selected few. Writing 
concurrent programs is a non-trivial task because of the complexity of ensuring 
proper synchronization. Conventional lock based synchronization (i.e., mutual 
exclusion) suffers from well known limitations, so researchers considered non- 
blocking transactions as an alternative. Herlihy and Moss [TH] proposed Trans- 
actional Memory (TM), as an alternative implementation of mutual exclusion, 
which avoids many of the drawbacks of locks, e.g., deadlock, reliance on the 
programmer to associate shared data with locks, priority inversion, and fail- 
ures of threads while holding locks. Shavit and Touitou [23] extended this idea 



to Software-only Transactional Memory (STM) by proposing a novel software 
method for supporting flexible transactional programming of synchronization 
operations |15I12I13| . 

A transaction consists of a sequence of read and write operations to a set of 
shared system resources (e.g. shared memory locations). Transactions may con- 
flict when they access the same shared resources. If a transaction T discovers that 
it conflicts with another transaction T' (because they share a common resource) , 
it has two choices, it can give T' a chance to commit by aborting itself, or it can 
proceed and commit by forcing T' to abort; the aborted transaction then retries 
again until it eventually commits. To solve the transaction scheduling problem 
efficiently, each transaction consults with the contention manager module for 
which choice to make. Dynamic STM (DSTM) [TS], proposed for dynamic-sized 
data structures, is the first STM implementation that uses a contention man- 
ager as an independent module to resolve conflicts between two transactions and 
ensure progress. Of particular interest are greedy contention managers where a 
transaction restarts immediately after every abort. As TM has been gaining 
attention, several (greedy) contention managers have been proposed in the lit- 
erature |2lllllOI5l21ll9j . which have been assessed formally and experimentally 
by specific benchmarks [20 . 

A major challenge in guaranteeing progress through transactional contention 
managers is to devise a policy which ensures that all transactions commit in the 
shortest possible time. The goal is to minimize the makespan which is defined as 
the duration from the start of the schedule, i.e., the time when the first transac- 
tion is issued, until all transactions commit. The makespan of the transactional 
scheduling algorithm can be compared to the makespan of an optimal off-line 
scheduling algorithm to provide a competitive ratio. The makespan and competi- 
tive ratio primarily depend on the workload — the set of transactions, along with 
their arrival times, duration, and resources they read and modify [3]. 

The performance of some of the contention managers has been analyzed for- 
mally in |3I2I11I10I2TT22] (the detailed description is given in Section OJ . The 
best known formal bound is provided in [2] where the authors give an O(s) 
competitive ratio bound, where s is the number of shared resources. When the 
number of resources s increases, the performance degrades linearly. A difficulty 
in obtaining better competitive ratios is that the scheduling problem of n con- 
current transactions is directly related to the vertex coloring problem which is a 
hard problem to approximate |17j . A natural question which we address here is 
whether it is possible to obtain better competitive ratios. As we show below, it is 
indeed possible to obtain sub-linear competitive ratios for balanced transaction 
workloads. 

1.1 Contributions 

In this paper, we study contention management in the context of balanced work- 
loads which have better performance potential for transactional memory. A bal- 
anced workload consists of a set of transactions in which each transaction has the 



following property: if the transaction performs write operations, then the num- 
ber of writes it performs is a constant fraction of the total number of operations 
(read and writes) of the transaction. The balancing ratio /3 expresses the ratio 
of write operations of a transaction to the overall operations of the transaction. 
The balancing ratio is bounded as = < j3 < 1, since a writing transaction writes 
to at least one resource. In balanced workloads (3 = 0(1) for all the transactions 
which perform writes. Balanced workloads can also include read-only transac- 
tions, but we assume that there is at least one transaction that performs writes, 
since otherwise the scheduling problem is trivial (no conflicts). 

Balanced transaction workloads represent interesting and practical transac- 
tion memory scheduling problems. For example balanced workloads represent 
the case where we have small sized transactions each accessing a small (con- 
stant) number of resources, where trivially (3 — 0(1). Other interesting sce- 
narios are transaction workloads which are write intensive, where transactions 
perform many writes, as for example in scientific computing applications where 
transactions have to update large arrays. 

We present two new polynomial time contention management algorithms 
which are especially tailored for balanced workloads and analyze their theoretical 
performance boundaries from the worst-case perspective. The first algorithm, 

called Clairvoyant, is O {^t ■ ■\ffj -competitive where s is the number of shared 

resources, and I expresses the logarithm ratio of the longest to shortest execution 
times of the transactions. (The transaction execution time is the time it needs 
to commit uninterrupted from the moment it starts.) For balanced transaction 
workloads where f3 = 0(1), and when transaction execution times are close 
to each other, i.e. I = 0(1), Algorithm Clairvoyant is 0( v / s)-competitive. This 
algorithm is greedy and has the pending commit property (where at least one 
transaction executes uninterrupted each time). However, it depends on assigning 
priorities to the transactions based on the explicit knowledge of the transaction 
conflict graph which evolves while the execution of the transactions progresses. 
It also assumes that each transaction knows how long is its execution time and 
how many resources it accesses. 



The second algorithm, called Non-Clairvoyant, is I £ ■ . /-§ • logn 



competitive, with high probability (at least 1 — —), where n is the number 
of transactions concurrently executing in n threads. For balanced transaction 
workloads, where (3 — 0(1), and when transaction execution times are close to 
each other, i.e. I — 0(1), Algorithm Non-Clairvoyant is 0(y / s-logn)-competitive. 
This is only a 0(log n) factor worse than Clairvoyant, but does not require explicit 
knowledge of the conflict graph. The algorithm is also greedy. This algorithm 
uses as a subroutine a variation of the RandomizedRounds scheduling algorithm 
by Schneider and Wattenhofer [21] which uses randomized priorities and doesn't 
require knowledge of the conflict graph. 

The 0(y/s) bound of Algorithm Clairvoyant is actually tight. Through a re- 
duction from the graph coloring problem, we show that it is impossible to ap- 
proximate in polynomial time any transactional scheduling problem with (3 = 1 




and £ — 1 with a competitive ratio smaller than 0((y / s) 1_e ) for any constant 
e > 0, unless NPCZPP. To our knowledge, these results are significant improve- 
ments over the best previously known bound of O(s) for transactional memory 
contention managers. For general workloads (including non-balanced workloads), 
where transactions are equi-length (£ = 0(1)), our analysis gives 0(s) compet- 
itive worst case bound, since (3 > l/s. This bound matches the best previously 
known bound of O(s) for general workloads. The parametrization of that we 
provide gives more tradeoffs and flexibility for better scheduling performance, as 
depicted by the performance of our algorithms in balanced workloads. 

1.2 Related Work 

Almost 10 year after publishing the seminal paper [16 to introduce the new 
research area of transactional memory, Herlihy et al. [15] proposed Dynamic 
STM (DSTM) for dynamic-sized data structures. Later on, several other STM 
implementations have been proposed, such as TL2 [4] , TinySTM [8] , and RSTM 
[T5] to name a few. Among them, DSTM is the first practical obstruction-fre^H 
implementation that seeks advice from the contention manager module to either 
wait or abort a transaction at the time of conflict. 

Several contention managers have been proposed in STM and the perfor- 
mance of some of them has been analyzed formally in |3|2|11|10|21|22] . The first 
formal analysis of the performance of a contention manager is given by Guerraoui 
et al. [IT] where they present the Greedy contention manager which decides in fa- 
vor of older transactions using timestamps and achieves 0(s 2 ) competitive ratio. 
This bound holds for any algorithm which ensures the pending commit property 
(see Definition [lj. Attiya et al. [2] improve the competitive ratio to O(s), and 
prove a matching lower bound of Q(s) for any deterministic work-conserving 
algorithm which schedules as many transactions as possible (by choosing a max- 
imal independent set of transactions) . The model in [5] is non-clairvoyant in the 
sense that it requires no prior knowledge about the transactions while they are 
executed. 

Schneider and Wattenhofer [2TJ present a deterministic algorithm Com m it- 
Bounds with competitive ratio 0(s) and a randomized algorithm Randomize- 
dRounds with makespan O(Clogn) with high probability, for a set of n trans- 
actions, where C denotes the maximum number of conflicts among transactions 
(assuming unit execution time durations for transactions). Sharma et al. [22j 
study greedy contention managers for M x N execution windows of transac- 
tions with M threads and N transactions per thread and present and analyze 
two new randomized greedy contention management algorithms. Their first al- 
gorithm Offline-Greedy produces a schedule of length 0(r max • (C + N \og(M N))) 
with high probability, where r max is the execution time duration of the longest 
transaction in the system, and the second algorithm Online-Greedy produces a 
schedule of length O (r max ■ (C \og(M N) + N log 2 (MN))). The competitiveness of 

A synchronization mechanism is obstruction-free if any thread that runs for a long 
time it eventually makes progress [14| . 



both of the algorithms is within a poly-log factor of O(s). Another recent work 
is Serializer [5] which resolves a conflict by removing a conflicting transaction T 
from the processor core where it was running, and scheduling it on the processor 
core of the other transaction to which it conflicted with. It is 0(n)-competitive 
and in fact, it ensures that two transactions never conflict more than once. 

TM schedulers [31612411] offer an alternative approach to boost the TM per- 
formance. A TM scheduler is a software component which decides when a partic- 
ular transaction executes. One proposal in this approach is Adaptive Transaction 
Scheduling (ATS) [24 which measures adaptively the contention intensity of a 
thread, and when the contention intensity increases beyond a threshold it serial- 
izes the transactions. The Restart and Shrink schedulers, proposed by Dragojevic 
et al. [6], depend on the prediction of future conflicts and dynamically serialize 
transactions based on the prediction to avoid conflicts. The ATS, Restart, and 
Shrink schedulers are 0(n)-competitive. Steal-On-Abort [I] is yet another pro- 
posal where the aborted transaction is given to the opponent transaction and 
queued behind it, preventing the two transactions from conflicting again. 

Recently, Attiya et al. 3 proposed the BIMODAL scheduler which alternates 
between writing epochs where it gives priority to writing transactions and read- 
ing epochs where it gives priority to transactions that have issued only reads so 
far. It achieves O(s) competitive ratio on bimodal workloads with equi-length 
transactions. A bimodal workload contains only early-write and read-only trans- 
actions. 

Outline of Paper. The rest of the paper is organized as follows. We present our 
TM model and definitions in Section [5J We present and formally analyze two 
new randomized algorithms, Clairvoyant and Non-Clairvoyant, in Sections |3] and 
|4l respectively. The hardness result of balanced workload scheduling is presented 
in Section [5j Section [6] concludes the paper. 

2 Model and Definitions 

Consider a system of n > 1 threads V = {Pi, • • • , P n } with a finite set of s shared 
resources 1Z = {Ri, ■ ■ . , R s }. We consider batch execution problems, where the 
system issues a set of n transactions T = {Ti, • • • ,T n } {transaction workload), 
one transaction Tj per thread Pi. Each transaction is a sequence of actions 
(operations) each of which is either a read or write to some shared resource. 
The sequence of operations in a transaction must be atomic: all operations of a 
transaction are guaranteed to either completely occur, or have no effects at all. 
A transaction that only reads shared resources is called read-only; otherwise it 
is called a writing transaction. We consider transaction workloads where at least 
one transaction is writing. 

After a transaction is issued and starts execution it either commits or aborts. 
A transaction that has been issued but not committed yet is said to be pending. 
A pending transaction can restart multiple times until it eventually commits. 
Concurrent write- write actions or read-write actions to shared objects by two or 



more transactions cause conflicts between transactions. If a transaction conflicts 
then it either aborts, or it may commit and force to abort all other conflicting 
transactions. In a greedy schedule, if a transaction aborts due to conflicts it 
then immediately restarts and attempts to commit again. We assume that the 
execution time advances synchronously for all threads and a preemption and 
abort require negligible time. We also assume that all transactions in the system 
are correct, i.e., there are no faulty transactions 

Definition 1 (Pending Commit Property [llj). A contention manager 
obeys the pending commit property if, whenever there are pending transactions, 
some running transaction T will execute uninterrupted until it commits. 

Let H(Ti) denote the set of resources used by a transaction T{. We can write 
lZ(Ti) = lZ w (Ti) U lZ r (Ti), where 1Z w (Ti) are the resources which are to be 
written by Ti, and lZ r (Ti) are the resources to be read by Tj. 

Definition 2 (Transaction Conflict). Two transactions Ti andTj conflict if 
at least one of them writes on a common resource, that is, there is a resource R 
such that R 6 (7l w (Ti) PilZ(Tj)) U (7£(Tj) (~}lZ w (Tj)) (we also say that R causes 
the conflict). 

From the definition of transaction conflicts we can define the conflict graph for 
a set of transactions. In the conflict graph, each node corresponds to a transaction 
and each edge represents a conflict between the adjacent transactions. 

Definition 3 (Conflict Graph). For a set of transactions T, the conflict 
graph G(T) — (V, E) has as nodes the transactions, V = T, and (Ti,Tj) G E 
for any two transactions Ti,Tj that conflict. 

Let j(Rj) denote the number of transactions that write resource Rj. Let 
7max = maxjj(Rj). Denote X w (Ti) = \U w {Ti)\, A r (T^) = \R r (T.i)\, and A(T;) = 
|72.(Tj)|, the number of resources which are being accessed by transaction Tj for 
write, read, and both read and write. Let A max = max^ A(Tj). Note that in the 
conflict graph G the maximum node degree is bounded by A max • 7 ma x, and also 
there is a node whose degree is at least 7 max - 

For any transaction Tj we define the balancing ratio /3(2~i) = ^-i^j as the 
ratio of number of writes versus the total number of resources it accesses. For a 
read-only transaction /3(T,) = 0. For a writing transaction it holds - < /3(Tj) < 
1, since there will be at least one write performed by Ti to one of the s resources. 
We define the global balancing ratio as the minimum of the individual writing 
transaction balancing ratios: ft = min(T ( eT)A(A w (T ( )>o) P{Ti). We define balanced 
transaction workloads as follows (recall that we consider workloads with at least 
one writing transaction): 

2 A transaction is called faulty when it encounters an illegal instruction producing a 
segmentation fault or experiences a page fault resulting to wait for a long time for 
the page to be available [TO] . 



Definition 4 (Balanced Workloads). We say that a workload (set of trans- 
actions) T is balanced if ft = O(l). 



In other words, in balanced transaction workloads the number of writes that 
each writing transaction performs is a constant fraction of the total number of 
resource accesses (for read or write) that the transaction performs. 

Each transaction J 1 , has execution time duration > 0. The execution time is 
the total number of discrete time steps that the transaction requires to commit 
uninterrupted from the moment it starts. In our model we assume that the 
execution time of each transaction is fixed. Let T max = max^ r,; be the execution 
time of the longest transaction, and r m i n = min^ n be the execution time of the 



shortest transaction. We denote £ 



1. We finish this section 



with the basic definitions of makespan and competitive ratio. 

Definition 5 (Makespan and Competitive Ratio). Given a contention 
manager A and a workload T, makespan a(T) is the total time A needs 
to commit all the transactions in T . The competitive ratio is CRa{T) = 
makespan a (T) where opt is the optimal off-line scheduler. 

makespan op t{ I ) 7 r JJ 

3 Clairvoyant Algorithm 

We describe and analyze Algorithm Clairvoyant (see Algorithm [T]) . The writ- 
ing transactions are divided into £ groups Aq, A\, . . . , A^\, where £ = 
log ^ ^°"' x ^ + 1, in such a way that Ai contains transactions with execution 

time duration in range [2* • T m j n , (2 I+1 — 1) • r m j n ], for < i < t— 1. Each group of 
transactions Ai is then again divided into K subgroups A®, Aj, . . . , A^ _1 , where 
k — [logs] + 1, such that each transaction T G Aj accesses (for read and write) 
a number of resources in range A(T) £ [2 J , 2 J+1 — 1], for < j < k — 1. We assign 
an order to the subgroups in such a way that A\ < A l k if i < k or i = k A j < I. 
Note that some of the subgroups may be empty. The read-only transactions are 
placed into a special group B which has the highest order. 

At any time t the pending transactions are assigned a priority level which de- 
termines which transactions commit or abort. A transaction is assigned a priority 
which is one of: high or low. Let 7J t h and TI l t denote the set of transactions which 
will be assigned high and low priority, respectively, at time t. In conflicts, high 
priority transactions abort low priority transactions. Conflicts between trans- 
actions of the same priority level are resolved arbitrarily. Suppose that A t is 
the lowest order subgroup that contains pending transactions at time t. Only 
transactions from A t can be given high priority, that is n' t l C A t . 

The priorities are determined according to the conflict graph for the trans- 
actions. Let 7* denote the set of all transactions which are pending at time t. 
(Initially, To = T ) Let % denote the pending transactions of A t at time t. (Ini- 
tially, % — Aq.) Let St denote the set of transactions in % which are pending 
and have started executing before t but have not yet committed or aborted. Let 
S[ denote the set of transactions in Tt which conflict with St . Let It be a maximal 



Algorithm 1: Clairvoyant 



Input: A set T of n transactions with global balancing ratio (3; 
Output: A greedy execution schedule; 

- Divide writing transactions into £ = [log(-p as2 ')] + 1 groups A , Ai, ■ ■ ■ , A e -i in 
such a way that Ai contains transactions with execution time duration in range 
[2 l • Tmin, (2 l+1 — 1) • T m in] ; Read-only transactions are placed in special group B; 

- Divide Ai again into k = [log s] + 1 subgroups A°, A} , ■ ■ ■ , A^ 1 in a way that 
each subgroup AI contains transactions that access a number of resource in the 
range [2 j ,2 J+1 - 1]; 

- Order the groups and subgroups such that A? < A l k if i < k or i = k A j < I; 
special group B has highest order; 

foreach time step t = 0, 1, 2, 3, . . . do 
Set Definitions: 

7t: set of transactions that are pending; // To 4— T 

A t : lowest order group that contains pending transactions; 

Tf. set of transactions in A t which are pending; // 7o Aq 

St- set of transactions in 7t which were started before t; 

S' t : set of conflicting transactions in Tt which conflict with St ; 

It : maximal independent set in the conflict graph G(Tt \ S' t ); 
Priority Assignment: 

High priority transactions: 7T t ft I t U S't; 

Low priority transactions: Il\ <— Tt\ lit; 
Conflict Resolution: 

Execute all pending transactions; 

On conflict of transaction T u with transaction T v : 

if (T u e 11?) A (T v e n\) then abort(T u ,T v ); else abort(T v ,T u ); 



independent set in the conflict graph G(7t \ S' t ). Then, the set of high priority 
transactions at time t is set to be = I t U St- The remaining transactions are 
given low priority, that is, 77 t ' = % \ 77 t '\ Note that the transactions in 77^ do 
not conflict with each other. The transactions TT/ 1 will remain in high priority 
in subsequent time steps t' > t until they commit, since the transactions in S t > 
are included in 77^. 

This algorithm is clairvoyant in the sense that it requires explicit knowledge 
of the various conflict relations at each time t. The algorithm is greedy, since at 
each time step each pending transaction is not idle. The algorithm also satisfies 
the pending commit property since at any time step t at least one transaction 
from A t will execute uninterrupted until it commits. We have assumed above 
that each transaction knows its execution length and the number of resources it 
accesses. Clearly, the algorithm computes the schedule in polynomial time. 



3.1 Analysis of Clairvoyant Algorithm 



We now give a competitive analysis of Algorithm Clairvoyant. Define r^ in = 
2 J • T m i n and r max = (2 l+1 — 1) • T m ; n . Note that the duration of each transaction 
T £ A\ is in range [r^ in ,T max ], and also T^ ax < 2r^ in . Define A^ lin = V and 
A^ ax = 2 3+1 - 1. Note that for each transaction T £ A\, A(T) £ [A^ in ,A^ ax ], 
and A4 ax < 2A^ in . Let 7^ (ii„) denote the number of transactions in a subgroup 
A\ that write 1 < v < s. Let 7 max = 77iax ie [ M ] !t , e [ M ]7f 

In the next results we will first focus on a subgroup A\ and we will assume 
that there are no other transactions in the system. We give bounds for the com- 
petitive ratio for Aj which will be useful when we later analyze the performance 
for all the transactions in T ■ 

Lemma 1. If we only consider transactions in subgroup A\ 7 then the competitive 
ratio is bounded by C 'RciairvoyantiAj) < 2 • A^ nax + 2. 

Proof. Since there is only one subgroup, A t — Aj. A transaction T £ A\ conflicts 
with at most A^ ax ■ 7 max other transactions in the same subgroup. If transaction 
T is in low priority it is only because some other conflicting transaction in 
A\ is in high priority. If no conflicting transaction is in high priority then T 
becomes high priority immediately. Since a high priority transaction executes 
uninterrupted until it commits, it will take at most A^ax ' 7max time steps until 
all conflicting transactions with T have committed. Thus, it is guaranteed that 
in at most X J max • 7 max ' T max time steps T becomes high priority. Therefore, T 
commits by time (A^ ax • 7 max + 1) • 7" max - Since T is an arbitrary transaction in 
A\ , the makespan of the algorithm is bounded by: 

makespanciairvoyantiAi) < (A^ ax • 7 ^ ax + 1) • r max . 

There is a resource that is accessed by at least 7 max transactions of Aj for write. 
All these transactions have to serialize because they all conflict with each other 
in the common resource. Therefore, the optimal makespan is bounded by: 



makespan op t{Al) > 7 



3 . T 3 
max ' min ' 



When we combine the upper and lower bounds we obtain a bound on the 
competitive ratio of the algorithm: 

nT} , A u _ makespanciairvXAf) (A max • 

CrtClairvAAJ = < < 2 ■ A^ lax + 2. 

makespan opt {Aj) 7 max • r J min 

Lemma 2. If we only consider transactions in subgroup Aj, then the competitive 
ratio is bounded by C ' Rciairvoyant{A\) < 4 • 

Proof. Since the algorithm satisfies the pending-commit property, if a transac- 
tion T e Aj does not commit, then some conflicting transaction T' £ A\ must 
commit. Therefore, the makespan of the algorithm is bounded by: 

makespan C iairvoyant(A\) <\Aj\ 



Each transaction in T G A\ accesses at least X W (T) resources for write. Since 
we only consider transactions in A{, X W (T) > (3-\ J miu > /?-A^ lax /2. Consequently, 
by the pigeonhole principle, there will be a resource R G 1Z which is accessed by 
at least J2reA j ^w(T)/s > \A^\ ■ f3 ■ A^ lax /(2s) transactions for write. All these 
transactions accessing R have to serialize because they conflict with each other. 
Therefore, the optimal makespan is bounded by: 



makespan opt {4) > l ^ l '^ A " iax • 



min ' 



When we combine the above bounds of the makespan we obtain the following 
bound on the competitive ratio of the algorithm: 

ru (AJ\- makespancia irV oyant(Al) |Af | • r^ ax s/fi 

makespan opt {A J i ) \ A i I P max . -J . \ J ma ^ 

^ 2s min 

From Lemmas [T] and [21 we obtain: 

Corollary 1. If we only consider transactions in subgroup A J -, then the 
competitive ratio of the algorithm is bounded by C 'Rciairvoyanti-^-D < 4 • 
min|A^ ax , jr^j- 

We now continue to provide a bound for the performance of individual groups. 
This will help to provide bounds for all the transactions. 

Lemma 3. If we only consider transactions in group Aj, then the competitive 
ratio of the algorithm is bounded by C Rciairvoyant(Ai) < 32 • y^f- 

Proof. Since Af nax = (2 J+1 — 1), Corollary [1] gives for each subgroup A\ compet- 
itive ratio 

CRciairvoyanMi) < 4 • min - 1, } < 8 ' min ^} • 

Let V = losis 2 /f3) . Note that min^,^} < 2*, Vj G [0, [i/>\]; and 

minj^',^} < = 2 2 ^' , Vj G [|_^J + 1,/c- 1]. Group A* contains k 
subgroups of transactions. In the worst case, Algorithm Clairvoyant will commit 
the transactions in each subgroup according to their order starting from the low- 
est order subgroup and ending at the highest order subgroup, since that's the 
order that the transactions are assigned a high priority. Therefore, 

K-l 

CRci air v. 

= / ^,CRci a i rv .(Al) + ^ CRciairv.(M) 
3=0 J=W+1 



/ W-J fe-i 



) 




<8- 2 2 



< 8 • (2 • 2^ + 2 • 2^) = 32 • 



\3=0 i=LV>J+i 



Theorem 1 (Competitive Ratio of Clairvoyant). For set of transactions T, 
Algorithm Clairvoyant has competitive ratio CRciairvoyant(T) = O [£• . /-§ j . 



Proof. As there are £ groups of transactions Ai, and one group B, in the worst 
case, Algorithm Clairvoyant will commit the transactions in each group according 
to their order starting from the lowest order group and ending at the highest 
order group. Clearly, the algorithm will execute the read-only transactions in 
group B in optimal time. Therefore, using Lemma El we obtain: 



The corollary below follows immediately from Theorem [T] 

Corollary 2 (Balanced Workload). For balanced workload T ((3 = 0(1) ) and 
when £ = 0(1), Algorithm Clairvoyant has competitive ratio C ' Rciairvoyant(T) = 



4 Non- Clairvoyant Algorithm 

We present and analyze Algorithm Non-Clairvoyant (see Algorithm^. This al- 
gorithm is similar to Clairvoyant given at Section [3] with the difference that the 
conflicts are resolved using priorities which arc determined without the explicit 
knowledge of the conflict graph. 

Similar to Algorithm Clairvoyant, the transactions are organized into groups 
and subgroups. Lower order subgroups have always higher priority than higher 
order subgroups. At each time step t, let A t denote the lowest order subgroup. 
Clearly, the transactions in A t have higher priority than the transactions in all 
other subgroups, and in case of conflicts only the transactions in A t win. When 
transactions in the same subgroup conflict, the conflicts are resolved according 
to random priority numbers. When a transaction starts execution it chooses 
uniformly at random a discrete number r(T) € [l,n]. In case of a conflict of 
transaction T w with another transaction T x in the same subgroup with r(T x ) < 
r(T w ), then T x aborts T w , and otherwise T w aborts T x . When transaction T w 
restarts, it cannot abort T x until T x has been committed or aborted. After 
every abort, the newly started transaction chooses again a new discrete number 
uniformly at random in the interval [1, n]. The idea of randomized priorities has 
been introduced originally by Schneider and Wattenhofer |21| in their Algorithm 
Randomized Rounds. 






O(Vi). 



Algorithm 2: Non-Clairvoyant 



Input: A set T of n transactions with global balancing ratio /3; 
Output: A greedy execution schedule; 

- Divide transactions into I = [log( ^ max )] + 1 groups Aq, Ai, • • • , Ag-\ in such a 
way that Ai contains transactions with execution time duration in range 

[2 l ■ T m in, (2 l+1 — 1) • r m in]; Read-only transactions are placed in special group B; 

- Divide Ai again into k = [log s] + 1 subgroups A®, A} , ■ ■ ■ , A1~ x in a way that 
each subgroup A\ contains transactions that access a number of resource in the 
range [2 J ,2 J+1 - 1]; 

- Order the groups and subgroups such that A\ < A^ if i < k or i = k A j < I; 
special group B has highest order; 

foreach time step t = 0, 1, 2, 3, . . . do 

Execute all pending transactions; // at t — issue all transactions 
On (re)start of transaction T: 

r(T) <— random integer in [1, n]; 
On conflict of transaction T u £ A\ with transaction T v G A k : 
if A\ < A l k then abort (T U ,T V ); 
else if A{ > A\ then abort(T v ,T u ); 

else if r(T u ) < r(T v ) then abort(T u ,T v ) ; // The case A\ = A l k 

else abort (T V ,T U ); 

II In case a transaction T u aborts T v because r(T u ) <r(T v ), 
then when T v restarts it cannot abort T u until T u 
commits or aborts 



This algorithm is non-clairvoyant in the sense that it does not depend on 
knowing explicitly the conflict graph to resolve conflicts. The algorithm is greedy 
but does have the pending commit property. The groups and subgroups can be 
implemented in the algorithm since we assume that each transaction knows 
its execution time and the number of resources that it accesses. Clearly, the 
algorithm computes the schedule in polynomial time. 

4.1 Analysis of Non- Clairvoyant Algorithm 

In the analysis given below, we study the properties of Algorithm Non-Clairvoyant 
and give its competitive ratios. We use the following adaptation of the response 
time analysis of Algorithm RandomizedRounds given in [21]. It uses the following 
Chernoff bound: 

Lemma 4 (Chernoff Bound). Let X\, X2, ■ ■ ■ , X n be independent Poisson tri- 
als such that, for 1 < i < n, Pr(Xi = 1) = pri, where < pri < 1. Then, for 
x = E7=i x i, M = E[A] = E?=iP r i, and any < S < l,Pr(A < (1 - 6)fi) < 

Lemma 5 (Adaptation from Schneider and Wattenhofer |21j). Given 
a transaction scheduling problem with n concurrent transactions, where each 



transaction has execution time at most r, the time span a transaction T needs 
from the moment it is issued until commit is 16 -e- (dr+1) -rdnn with probability 
at least 1 — where dx is the number of transactions conflicting with T. 

Proof. Consider the respective conflict graph G of the problem with the n trans- 
action. Let Nt denote the set of conflicting transactions for T (these are the 
neighbors of T in G). Let r(T) denote the random priority number choice of T 
in range [l,n]. The probability that for transaction T no transaction T 1 £ Nt 
has the same random number is: 

PrtfT' € N T \r(T) = r(T')) = (l - i Y > U - >\- 

The probability that r(T) is at least as small as r(T') for any transaction T 1 G 
Nt is ■ Thus, the chance that r(T) is smallest and different among all its 
neighbors in Nt is at least e .^ +1 ^ ■ If we conduct 16 • e • (dx + 1) • Inn trials, 
each having success probability e .^ +1 ) , then the probability that the number 
of successes Z is less than 8 Inn becomes: Vr(Z < 8 ■ Inn) < e - 21n ™ — 1/n 2 , 
using the Chernoff bound of Lemma 2] Since every transaction has execution 
time at most r, the total time spent until a transaction commits is at most 
16 • e • {dr + 1) • r • Inn, with probability at least 1 — 1/n 2 . 

We now give competitive bounds for some subgroup A\ and later extend the 
results to all the transactions in T. The proofs are similar as in the analysis of 
Algorithm Clairvoyant and can be found in the appendix. 

Lemma 6. If we only consider transactions in subgroup A\, then the competitive 
ratio is bounded by C RNon-Clairvoyant{A\) < 64 • e • A^ax • Inn with probability 
at least 1 — ^iX, 

Lemma 7. If we only consider transactions in subgroup A\ , then the competitive 
ratio is bounded by C 'R^ on-ciairvoyant{A{) < 64 • e • ^ ■ Inn with probability 

at least 1 — —i^. 

From Lemmas [S] and [71 we obtain: 

Corollary 3. If we only consider transactions in subgroup A\, then the com- 
petitive ratio of the algorithm is bounded by CRNon-ciairvoyant(Aj) < 64 • e • 

min |A^ nax , ^ \ ■ Inn with probability at least 1 



We now provide a bound for the performance of individual groups which will 
help to provide bounds for all the transactions. 

Lemma 8. If we only consider transactions in group Ai, then the competitive 
ratio of the algorithm is bounded by CRNon-ciairvoyant(Ai) < 512 • e • yjj ' l nn 

with probability at least 1 — 



Theorem 2 (Competitive Ratio of Non-Clairvoyant). For a set 

of transactions T, Algorithm Non-Clairvoyant has competitive ratio 

C 'Rn on-ciairvoyantiT) = O (i ■ ■ log nj with probability at least 1 — i. 

The corollary below follows immediately from Theorem [2] 

Corollary 4 (Balanced Workload). For balanced workload T (j3 = 0(1)) 
and when £ = 0(1), Algorithm Non-Clairvoyant has competitive ratio 
CR Non -ciairvoyant(T) = 0{yfs • logra) with probability at least 1 - K 



5 Hardness of Balanced Transaction Scheduling 

In this section, we show that the performance of Clairvoyant is tight by reducing 
the graph coloring problem to the transaction scheduling problem. 

A Vertex Coloring problem instance asks whether a given graph G is 
fc-colorable [9]. A valid fc-coloring is an assignment of integers {1, 2, • • • , k} (the 
colors) to the vertices of G so that neighbors receive different integers. The 
chromatic number, x(G) is the smallest k such that G has a valid fc-coloring. 
We say that an algorithm approximates x(G) with approximation ratio q(G) 
if it outputs u(G) such that x(G) < u(G) and u(G)/x(G) < q(G). Typically, 
q(G) is expressed only as a function of n, the number of vertices in G. It is 
well known that known Vertex Coloring is NP-complete. It is also shown 
in [7] that unless NPCZPP, there does not exist a polynomial time algorithm 
to approximate x(G) with approximation ratio 0(n 1_e ) for any constant e > 0, 
where n denotes the number of vertices in graph G. 

A Transaction Scheduling problem instance asks whether a set of trans- 
actions T with a set of resources TZ has makespan k time steps. We give a poly- 
nomial time reduction of the Vertex Coloring problem to the Transaction 
Scheduling problem. Consider an input graph G = (V, E) of the Vertex Col- 
oring problem, where |V| = n and \E\ = s. We construct a set of transactions 
T such that for each v e V there is a respective transaction T v 6 T; clearly, 
\T\ = \V\ = n. We also use a set of resources TZ such that for each edge e € V 
there is a respective resource R e G TZ; clearly, \TZ\ = \E\ = s. If e = (u, v) € E, 
then both the respective transactions T u and T v use the resource R e for write. 
Since all transaction operations are writes, we have that j3 = 1. We take all the 
transactions to have the same execution length equal to one time step, that is, 

Tmax — T min — 1; and I = 1. 

Let G' be the conflict graph for the transactions T ■ Note that G' is isomorphic 
to G. Node colors in G correspond to time steps in which transactions in G' are 
issued. Suppose that G has a valid fc-coloring. If a node v £ G has a color x, then 
the respective transaction T v £ G' can be issued and commit at time step x, 
since no conflicting transaction (neighbor in G') has the same time assignment 
(color) as T v . Thus, a valid fc-coloring in G implies a schedule with makespan 
k for the transactions in T. Symmetrically, a schedule with makespan k for T 
implies a valid fc-coloring in G. 



It is easy to see that the problem TRANSACTION SCHEDULING is in NP. 
From the reduction of the Vertex Coloring problem, we also obtain that 
Transaction Scheduling is TVP-complete. 

From the above reduction, we have that an approximation ratio q(G) of 
the Vertex Coloring problem implies the existence of a scheduling algo- 
rithm A with competitive ratio CR_a(T) = q(G) of the respective Transac- 
tion Scheduling problem instance, and vice- versa. Since s = \TZ\ — \E\ < n 2 , 
an (-\/s) 1_e competitive ratio of A implies at most an n 1_e approximation ra- 
tio of Vertex Coloring. Since, we know that unless NPCZPP, there does 
not exist a polynomial time algorithm to approximate x(G) with approxima- 
tion ratio 0(n 1_e ) for any constant e > 0, we obtain a symmetric result for the 
Transaction Scheduling problem: 

Theorem 3 (Approximation Hardness of Transaction Scheduling). Un- 
less NPCZPP, we cannot obtain a polynomial time transaction scheduling algo- 
rithm such that for every input instance with ft = 1 and £ = 1 of the TRANS- 
ACTION Scheduling problem the algorithm achieves competitive ratio smaller 
than O^y^s) 1-6 ) for any constant e > 0. 

Theorem [3] implies that the 0(y/s) bound of Algorithm Clairvoyant, given in 
Corollary [2] for /3 = 0(1) and i = 0(1), is tight. 

6 Conclusions 

We have studied the competitive ratios achieved by transactional contention 
managers on balanced workloads. The randomized algorithms presented in this 
paper allow to achieve best competitive bound on balanced workloads. We also 
establish hardness results on the competitive ratios in our balanced workload 
model by reducing the well known NP-complete vertex coloring problem to the 
transactional scheduling problem. 

There are several interesting directions for future work. As advocated in [15] , 
our algorithms are conservative — abort at least one transaction involved in 
a conflict — as it reduces the cost to track conflicts and dependencies. It is 
interesting to look whether the other schedulers which are less conservative can 
give improved competitive ratios by reducing the overall makespan. First, our 
study can be complemented by studying other performance measures, such as 
the average response time of transactions under balanced workloads. Second, 
while we have theoretically analyzed the behavior of balanced workloads, it is 
interesting to see how our contention managers compare experimentally with 
prior transactional contention managers, e.g., (51241 1111] . 
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A Proofs of Section 3] 



Proof of Lemma [6| 

Proof. Since there is only one subgroup, a transaction T E A- conflicts with at 
most dx < A4 ax • 7i ax other transactions in the same subgroup. From Lemma 

it will take at most x = 16 • e • (A^ lax • 7 I J na x + 1) ' T max ' mn time steps until T 
commits, with probability at least 1 — -=j. Considering now all the transactions 

in A? , and taking the union bound of individual event probabilities, we have 

that all the transactions in A\ commit within time x with probability at least 

I A? I lA^I 

1 — — Therefore, with probability at least 1 — the makespan is bounded 

by: * 

(A?) < 16 • e • (A^ ax • 7^ lax + 1) • T^ lax • Inn. 

Similar to Lemma [1] there is a resource that is accessed by at least 7j^ ax 
transactions of A\ for write so that all these transactions have to be serialized 
because of the conflicts. Therefore, the optimal makespan is bounded by: 

makespan opt {Al) > ■ r 3 min . 

By combining the upper and lower bounds, we obtain a bound on the com- 
petitive ratio: 

( AJ\ ma keSpani\f on — Clairvoyant{A^) 

O -T17V on — Clairvoyant ) — " 7~7T\ 

makespan opt [A+ ) 

— 3 3 

Tmax ' r min 

<32-e-(A4 ax + l)-lnn 
< 64 • e • A£ iax ■ In n, 

I Ai i 

with probability at least 1 — -^j- 1 -. 
Proof of Lemma [7j 

Proof. Since for any transaction T E A?, d T < \Nt\ < |^| — 1, similar to the 

proof of Lemma El with probability at least 1 — , the makespan is bounded 
by: 

(A?) < 16-e - \ A{\ ■ r£ ax • Inn. 
Similar to Lemma [21 the optimal makespan is bounded by: 

makespan opt {Al) > \3tl^M . ^ 



When we combine the above bounds of the makespan we obtain a bound on 
the competitive ratio: 

rR , .j, _ makespan Non - C iairvoyant(Al) 

makespan opt (A! ) 

~ |Aj|-/3-A J max j ~ \3 



nun 



I A j I 

with probability at least 1 — —^r- 
Proof of Lemma [8j 

Proof. Since A^ nax = (2- J+1 — 1), Corollary [3] gives for each subgroup A\ compet- 
itive ratio 

CR Non ^ciairvoyant{A{) < 64 • e ■ min ^2 1+l - 1, y+i^ 1 1 ' lnn 
< 128-e-min|2 J , ^| - Inn, 

A 3 I • m 

with probability at least 1 — —t-. Following the proof steps as in Lemma [3J we 
obtain: 

C Rn on — Clairvoyant 

(Ai) < 512 -e- J- Ann. 

\ A j \ mi 

This bound holds with with probability at least 1 — — = 1 — J— since 

e;=oI4i = ki. 

Proof of Theorem [2} 

Proof. As there are ^ groups of transactions A il and one group £?, in the worst 
case, Algorithm Non-Clairvoyant will commit the transactions in each group ac- 
cording to their order starting from the lowest order group and ending at the 
highest order group. Clearly, the algorithm will execute the read-only transac- 
tions in group B in optimal time. Therefore, using Lemma |8] we obtain: 

t 

C 'Rn on — Clairvoyant on— Clairvoyant 

(Ai) + CRNon — Clairvoyant (B) 

i=l 

e-i . — 
< V512 • e • J- ■ Inn + 1 

4=0 V ^ 

= 512- e-£- J^-lnn+1, 
with probability at least 1 — ^'="J* 4 ' = 1 — nT 1 , since Yli=o = \T\ = n - 



